CNN 모델의 그래디언트 플로우 분석과 성능 비교

박슬기; 홍명덕; 조근식; Seulgi Park; Myungduk Hong; Geunsik Jo; 노설현; Seol-Hyun Noh

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보과학회 논문지 > 정보과학회논문지 (Journal of KIISE)

정보과학회논문지 (Journal of KIISE)

Current Result Document : 10 / 38 이전건 다음건

한글제목(Korean Title)	CNN 모델의 그래디언트 플로우 분석과 성능 비교
영문제목(English Title)	Gradient Flow Analysis and Performance Comparison of CNN Models
저자(Author)	박슬기 홍명덕 조근식 Seulgi Park Myungduk Hong Geunsik Jo 노설현 Seol-Hyun Noh
원문수록처(Citation)	VOL 48 NO. 01 PP. 0100 ~ 0106 (2021. 01)
한글내용 (Korean Abstract)	CNNs(Convolutional Neural Networks)은 컴퓨터 시각 인식(Computer vision)과 자연어 처리(Natural language processing) 분야에서 뛰어난 성능을 보여 가장 널리 사용되고 있는 딥러닝 방법이 다. CNNs은 입력데이터에 컨볼루션 레이어를 연속적으로 적용하는 구조를 통해 입력 데이터의 locality와 correlation을 효과적으로 추출하여 CNNs의 깊이가 깊어질수록 신경망의 성능이 향상되어왔다. 그러나 CNNs의 깊이가 깊어질수록 신경망의 정확도가 반드시 높아지는 것은 아니다. 그래디언트 소실 문제 (Gradient vanishing problem)으로 인해 weighted layers의 가중치들이 수렴하지 않는 현상이 발생할 수 있기 때문이다. 이에 본 연구에서는 VGGNet 모델, ResNet 모델, DenseNet 모델의 gradient flow를 분석하고 비교함으로써 각 모델의 error rate 성능에 차이가 나는 근거를 도출하였다.
영문내용 (English Abstract)	Among the various deep learning techniques available, convolutional neural networks (CNNs) are widely used due to their superior performance in the fields of computer vision and natural language processing. CNNs can effectively extract the locality and correlation of input data using structures wherein convolutional layers are successively applied to the input data. The performance of neural networks has generally been improved as the depth of CNNs has increased. However, an increase in the depth of a CNN is not always accompanied by a corresponding increase in the accuracy of the neural network. This is because the gradient vanishing problem may occur, thereby causing the weights of the weighted layers to fail to converge. Accordingly, in the present study, the gradient flows of the VGGNet model, ResNet model, and DenseNet model were analyzed and compared, and reasons for the differences in the error rate performances of the models was derived.
키워드(Keyword)	이상 탐지 광학 흐름 객체 중심 딥러닝 인공지능 anomaly detection optical flow object-centric deep learning artificial intelligence CNN 그래디언트 소실 그래디언트 플로우 성능 비교 오류율 CNN gradient vanishing problem gradient flow performance comparison error rate
파일첨부	PDF 다운로드